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A simple microstructure return model 
explaining microstructure noise and Epps effects 



Abstract 

We present a simple microstructure model of financial returns that 
combines (i) the well-known ARFIMA process applied to tick-by-tick 
returns, (ii) the bid-ask bounce effect, (iii) the fat tail structure of the 
distribution of returns and (iv) the non-Poissonian statistics of inter- 
trade intervals. This model allows us to explain both qualitatively and 
quantitatively important stylized facts observed in the statistics of mi- 
crostructure returns, including the short-ranged correlation of returns, 
the long-ranged correlations of absolute returns, the microstructure 
noise and Epps effects. According to the microstructure noise effect, 
volatility is a decreasing function of the time scale used to estimate it. 
Paradoxically, the Epps effect states that cross correlations between 
asset returns are increasing functions of the time scale at which the 
returns are estimated. The microstructure noise is explained as the 
result of the negative return correlations inherent in the definition of 
the bid-ask bounce component (ii). In the presence of a genuine cor- 
relation between the returns of two assets, the Epps effect is due to 
an average statistical overlap of the momentum of the returns of the 
two assets defined over a finite time scale in the presence of the long 
memory process (i). 
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1 Introduction 



In the last decade, a lot of attention has been directed towards gathering 
empirical facts and developing theoretical understanding of the microstruc- 
ture (i.e., on tick-by-tick scales) characterizing the behavior of stocks prices, 
their corresponding returns and volatility (see for instance [H IH El H ESI 
[221 ES])- These investigations have revealed seemingly contradictory styl- 
ized facts in the statistical behavior of the dynamics of prices and returns. 
Let us mention the well-known short-ranged correlations of returns together 
with the long-ranged correlations of the absolute value of returns [H [10] , the 
microstructure noise effect of reahzed volatility [3 [23] and the Epps ef- 
fect of cross-correlations of returns [71 |TT] . The microstructure noise effect 
refers to the observation that volatility is a decreasing function of the time 
scale used to estimate it. In contrast, the Epps effect describes the fact that 
cross correlations between asset returns are increasing functions of the time 
scale at which the returns are estimated. These stylized facts are observed 
in a large variety of financial markets, including stocks, futures and options 
and they are thus considered as universal properties that are inherent to 
high-frequency financial markets data. However, there is still no theoretical 
model that can account simultaneously for the four mentioned stylized facts 
(see however Ref . [12] which emphasizes that the asynchrony of trades as well 
as the decimalization of stock prices are large contributors to the Epps effect. 
This paper also contains a review of previous papers attempting to explain 
the Epps effect). It is the purpose of the present paper to fill this gap and 
propose simple model for the microstructure of returns that describes these 
stylized facts both at a qualitative and quantitative level. 

Our model of tick-by-tick returns contains the following ingredients: (i) 
a long-memory centered Gaussian variable following an ARFIMA process; 
(ii) a sign bounce generalizing the bid-ask bounce effect governed by a quasi- 
periodic Bernouilli random variable; (iii) a random amplitude drawn from a 
fat-tail distribution; and (iv) the specification of the iid sequence of inter- 
trade intervals distributed according to a given probability density function 
chosen as a simple Weibull or a generalized gamma distribution. We show 
that the microstructure noise effect results simply from the negative return 
correlations inherent in the definition of the sign bounce component (ii). In 
the presence of a genuine correlation between the returns of two assets, the 
Epps effect appears due to an average statistical overlap of the momentum of 
the returns of the two assets defined over a finite time scale in the presence of 
the long memory process (i). In our microstructure framework, correlations 
between two assets correspond to buy or sell pressure exerted on both of 
them, but not exactly at the same time (except if the correlation is 1), due 
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to possible delays in implementation as well as herding effects between trad- 
ing decisions resulting from human decisions or algorithmic high-frequency 
trading. We stress that, notwithstanding this long memory (i), the sign 
bounce process (ii) ensures that auto-correlations of returns are short-lived, 
in agreement with empirical evidence. 

The presentation is organized as follows. Section 2 describes in details our 
model and derives the quantitative characteristics of the long ranged corre- 
lations of the absolute tick-by-tick returns. Section 3 discusses returns corre- 
lations in calendar time scale. Significant attention is given to the statistical 
description of inter-trade time intervals. Section 4 describes quantitatively 
the statistical properties of the microstructure noise effect in the frame of 
the proposed model. Then, Section 5 reveals the roots of the Epps effect, as 
suggested from our microstructure model. Two appendices provide detailed 
information on the derivation of the analytic expressions used in the main 
text. 



2 Model for the microstructure of financial 
returns at the tick scale 

2.1 Instantaneous returns vs tick- by-tick returns 

Let L{t) be the log-price of some asset. The corresponding return -Ra(^) 
as time scale A is defined as the increment of the log-price over the time 
interval {t — A,t) 

RA{t) = L{t) - L{t - A) . (1) 

In the present work, we develop a phenomenological theory of returns Ra {t) 
based on the instantaneous return R{t) defined as the derivative of log- price 
L(t): 

(2) 

Knowing the instantaneous returns, the A-scale returns -Ra(^) are obtained 
in integral form 

RA{t) = [ R{t')dt' . (3) 



Jt-A 

We consider the situation where the instantaneous returns are delta-pulses 
of the form 

R{t) = Y,rk5{t-tk) ■ (4) 

k 



4 



Expression (jl]) reflects the discrete nature of tlie process generating clianges 
of log-prices, which occur by jumps {r^} at the ordered tick-by-tick instants 
{tk}: 

■ ■ ■ < tk-i <tk< tk+i ... (to < 0, ti > 0) . (5) 

We suggest below that the sequences {tk} and {vk} are mutually sta- 
tistically independent. Moreover, we assume that the sequences {t^} and 
{rk} are stationary in the sense that the statistical properties of inter-trade 
intervals 

Tk,m tfc+m ^fc (6) 

and of pairs {rk+rn,rk) depend only on m. 

The instantaneous return R{t) and A-scale returns RA{t) are defined in 
the calendar time scale, where the time is measured in units of seconds, hours, 
years, and so on. In particular, expressions ([3]) and (jlj) are representations of 
returns and of instantaneous returns in the calendar time scale. Sometimes, 
it may be more convenient to use the tick-by-tick time scale, where one mea- 
sures time in subsequent trade numbers. In this sense, the sequence {rk} 
is the representation of instantaneous returns in the tick-by-tick time scale. 
Hereinafter, we shall discuss in details the interrelation between the instan- 
taneous return R{t) given by (jlj) defined in calendar time and the returns 
{rk} in tick-by-tick time. 



2.2 Formulation of the model of tick-by-tick returns 

Let us now define a simple stochastic model of tick-by-tick returns {rk} 
of the form 

rk := XkMk/Hk , (7) 

which captures both the short-range correlation of returns and the long-range 
correlation of the absolute value of returns. The sequence {Xk} is generated 
by a standard ARFIMA process [131 E] satisfying the discrete fractional 
difference equation 

{l~TfXk = T-Uk, de (0,1/2), (8) 

where T is the unit tick-lag operator, the m^'s are iid random variables with 
zero mean and unit variance, T is a normalizing constant such that po = 1 
where pm is the correlation function of the ARFIMA process 

Pm '■= E [XkXk+m] ■ (9) 

Bearing in mind the bid-ask bounce effect (see for instance [21 [IB]), the factor 
Mk in expression is equal to 

Mk := i-l)^" , (10) 
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where 



k 



(11) 



s=l 



The sequence {ts} takes into account the buy-sell structure of subsequent 
trades (ticks) in the following sense: if the {s — l)-th and s-th trades are 
the same (i.e. both are selling or buying), then ts = 0; in the opposite 
cases, is = I- We assume in our model that the sequence {is} consists of iid 
random Bernoulli integers, equal to one with probability p and zero with the 
complementary probability q = 1 ~ p. The probability q can be interpreted 
as the measure of distortion with respect to the periodic reference buy-sell 
process. Indeed, the closer q is to zero, the more likely a buy (sell) trade 
will be followed by a sell (buy) order. At g = 0, the sequences {M^} and 
{r^} have their signs being exact periodic functions of k, reflecting a perfect 
periodic buy-sell process that mimics a perfect bid-ask bounce. 

Definition 1 For short, we refer to the parameter q as the bounce distortion 
probability. 

The positive iid denominators {Hk} in expression ([7]) are chosen in such 
a way that the tick- by-tick returns {r^} possess a fat tail distribution similar 
to that found in empirical data, as specified in Appendix A. 3. 

Our model assumes that the sequences {X^}, {M^} and {H^} are mu- 
tually statistically independent. This implies in particular that the mean 
values of the tick-by-tick returns {vk} are all equal to zero. It is easy to 
generalize the theory developed below in the case where the means of {r^} 
are non-zero. 

2.3 Key properties 

In Appendix A.l, we show that the correlations {pm} defined by (Q of the 
ARFIMA process {X^} are described with excellent accuracy by the power 
law 



where a = 1 - 2rf G (0, 1). 

Appendix A. 2 derives that the correlation coefficients of the bounce fac- 
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tors {Mfc} are given by 



m 



E [Mk Mk^m] 



q := In 



= e 



—g m 




<q < 1/2, 
l/2<g<l, 



(13) 



The critical value Qc = 1/2 divides the interval q G (0, 1) into two parts, in 
which the behavior of the correlation coefficients Cm f|T3l) are qualitatively 
different. 

1. For g G (0, 1/2), Cm is a sign- alternating function of m. 

2. For q G (1/2, 1), Cm is a positive non-oscillating function of its argu- 



The existence of a sign-alternating behavior of Cm for small g's can be ex- 
pected from our previous remark that the closer q is to zero, the more likely 
a buy (sell) trade will be followed by a sell (buy) order. Note also that, 
the closer q is to the critical value qc = 1/2, the more short-ranged becomes 
Cm- Below, we discuss only the case q G (0,1/2), which is consistent with 
the known stylized fact that subsequent tick-by-tick returns and Vk+i are 
negatively correlated (see for instance [21 HE])- 

Appendix A. 3 describes the statistical properties that the iid sequence 
{iffc} need to obey in order for the distribution 0(r) of tick-by-tick returns 
{rfe} to be fat tailed. In the following, we suppose for definiteness that 0(r) 
is a Student distribution of the form 



2.4 Correlation of tick-by-tick returns 

Putting all the properties described in the previous subsection together, 
the correlation function of the tick-by-tick returns {r^} defined by ([7j) 



can be obtained explicitly. Taking into account the mutual statistical inde- 
pendence of the factors {Xk}, {Mk} and of the iid denominators {Hk}, and 



ment m. 




(14) 



where corresponds to a tail exponent /i. 



K-m ■= E [r^rfc+m] = E [XkXk+mMkMk+m/ HkHk+m] , 



(15) 
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using relations, ( IT^ . (fTSjl . one has 




m = *^^^) 
7(-l)™m-°, m ^ 1, 62 

where := E [-f^^^^j is the inverse moment of the denominators {Hk} given 
by expression (lA.lSp . 

Figure 1 shows the autocorrelation function fCm/62 for a = 0.1 and for 
different values of the bounce distortion probabihty q. 

2.5 Long-range correlations of absolute returns 

Consider now the autocorrelation function 





e 






[\rk 


1 


E 




2ej . 


- [In 





^"^w ■= ,' :r '^..^r' (17) 



of power 6 of the absolute values of the tick-by-tick returns. 

Using the quadratic approximation (1A.26P and relations (]A.18|) . (]A.20p . 
we can rewrite expression f lA.2ip in the form 



1, m = 0, 

AUd)-{ a:=2aG(0,2), (18) 

Xm^'^, m ^ 1, 

where the factor x is given by relation flA.28p . It follows from the analysis of 
Appendix A. 4 that the power law in f lTS]) constitutes an extremely accurate 
description of Am{(^) for a wide range of the parameters /i, 6 and for any 
a e (0,2). This is visualized in the log-log representation of figure 2, which 
shows the exact dependence of the autocorrelation function of the absolute 
returns. 

The main properties the correlation function (1181) of the absolute returns 
follows. 



1. Relation (|T8l) means that the correlations of the absolute returns are 
long-ranged if the exponent a is sufficiently small (less than 1 in order 
to technically qualify as "long memory" [B] ) . Empirically, typical values 
of the exponent are found in the range a G (0.2, 0.4) [S]. 

2. The exponent a is an empirically observable parameter as it can be eas- 
ily obtained from an appropriate statistical analysis of financial data. 
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This is in contrast with the exponent a of the correlation of returns 
given by flTB]) . which is unobservable due to the distortion of the bid-ask 
bounce that remove the information on this underlying power decay. 

As shown in Appendix A. 4, the power law decay of the correlation of 
the absolute returns has the same tail exponent o for any Q G (0.5, 1.5), 
for which the quadratic approximation (1A.26P of the function T[Q^ p) 
(1A.22P is highly accurate. 

The shape of the correlation function Am{&) as a function of Q does not 
depend on m and o but is different for different values of the exponent 
/i. The dependences on Q of the normalized autocorrelation function 

A(^) := -^"^f m > , (19) 

^ ' maxe^^(e) ^ ' 

for different values of the exponent /i of the Student distribution 0(r) 
f|T^ . are shown in figure 3. We observe a shape similar to the daily 
returns correlations shown in Ref. [TO]. 



3 Correlation of returns in calendar time scale 

In the previous sections, we have described the statistical properties of 
returns of our model in the tick-by-tick time scale. Henceforth, we present 
a study of the statistical properties of instantaneous returns R{t) ^ and of 
A-scale returns Ris.{f) (E]) in calendar time scale. This allows us to describe 
the microstructure noise and Epps effects. 



3.1 Correlation function of instantaneous returns 

The covariation function of instantaneous returns i?(t) (jl]) in the calendar 
time scale representation defined by K{t^ := E [_R(t)_R(t + r)] is obtained 
from the autocorrelation function Bm fll6p at the tick-by-tick time scale by 
relation 

K{T) = ve2W)^B{r)\ , (20) 

where 

oo 

B(r) = ^i3™/,„(|r|), (21) 

m=l 

£2 is the variance of the tick-by-tick returns, v is the mean rate of returns 
given by 

V = 1/f , f := E , (22) 
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while fm.i'T) (t > 0) is the probabihty density function of tick-by-tick time 
interval durations r^^m ©• A short outline of the derivation of relation fl?I]) 
is given in Appendix B.l. It is convenient to use the mean time duration f 
between subsequent trades as the unit time to scale all calendar times. 

Definition 2 In the following, we scale all time scales by the intertrade mean 
duration f, referring to this transformation as the "calendar time scale rep- 
resentation". 

In the calendar time scale representation, we have f = 1 and relation f l2Up 
takes the reduced form 

K{r) = e, [6{t) + B{r)] . (23) 

3.2 Spectrum of instantaneous returns 

In what follows, we need to evaluate the spectrum of function K{r) 

S2 r°° 

K{uj) := — / Ki^) cos{uT)dT . 



7r 



Using ([21]), (122]), we obtain 



27T 



l + Biu) 



where 



m=l 



(24) 



(25) 



(26) 



B{uj) =2 B{t) cos{uT)dT = 2Re 
Jo 

and fm{s) is the Laplace image of the probability density function /m(T) 

POO 

Lis) := / fm{r)e-'^dT. (27) 



Although some authors are found evidence of long-ranged correlations be- 
tween different inter-trade time intervals [U], we, along with Ref. [20], sug- 
gest henceforth that tick-by-tick instants {tk} are such that the random inter- 
trade durations {rt} are iid random variables with the same probability den- 
sity function /(r). 

Assertion 1 Our micro structure return model ([7]) of tick-by-tick returns is 
fully defined when supplemented by the specification of the iid sequence {t^} of 
inter-trade intervals distributed according to the probability density function 
fir). 
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Within our microstructure return model, the probabihty density functions 
fm{T) are equal to m-tiple convolutions of /(t). Accordingly, relation 
takes the form 



B(u) := 2Re 



m=l 



{2i 



Substituting the expression (|T6l) for the autocorrelation function of the tick- 
by-tick returns and after summing the series, we obtain 



2(1 - 2g)7 

r(i + a) 7o 



Re 



e«^/" + (1 - 2q)f{iuj) 



du. 



(29) 



The factor 7 is defined from (ITB]) with (1121) . 



3.3 Statistics of inter-trade time intervals 

In what follows, we use relation f l29|) to analyze quantitatively the mi- 
crostructure noise and Epps effects. The characteristics of these effects de- 
pend essentially on the statistics of the inter-trade time intervals. Under the 
condition that the inter-trade durations are iid, all information about their 
statistics is contained in the probability density function fij). We thus first 
discuss possible models for this probability density function. 

In the econometric literature, the statistics of inter-trade intervals has 
been studied empirically and parametric models have been proposed (see, 
for instance, [15], [T71 [I9|). Here, we discuss a few analytical models for 
the probability density functions /(t), which provide reasonable fits to the 
empirical distributions of inter-trade intervals while at same time possess 
a convenient analytical expression for their Laplace images f{s). Having 
such analytical approximations is very useful for the determination of the 
spectrum B{uj) (129|) . which is defined in terms of the Laplace image fiiuj) of 
the probability density function of inter-trade intervals expressed in terms of 
the imaginary argument iu. 

A model often used to represent the distribution of inter-trade intervals 
is the WeibuU distribution 

/(r) = \w{\t- /3), w{t- P) := ^r^-'e-^' , (30) 

where /3 is the shape parameter and A is the scaling parameter. The corre- 
sponding survival probability Q{t) := f{u)du is given by expression 

Q{t) = W{\t][5), W{t;I3):= w{u; /S) du = e'^" . (31) 
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In the reduced time scale representation in which the unit duration is taken 
as the mean inter-trade interval f, the Weibull distribution depends only 
on the shape parameter /3, while the scale parameter A is defined by the 
condition 

f = Q{T)dT = l A = A^(/3) := r (^^^ . (32) 

Figure 4 presents typical time sequences {tk} of trades with tk = J2t=i 
where the inter-trade intervals r^'s are drawn from the Weibull distribution 
with shape parameters (3 = 0.5; 1;2 respectively. Notwithstanding the mu- 
tual independence of the inter-trade intervals {t^}, one can observe clustering 
for /3 < 1 and quasi-periodicity for /3 > 1. 

However, the Weibull distribution family has some shortcoming. First, its 
Laplace image does not have a simple analytical representation for any value 
of p. Second, it has not enough shape parameters in order to fit the empirical 
probability density function over the whole range of r values for which the 
empirical probability density function is still measured accurately. Indeed, 
using (3 ~ 0.75 ^ 0.85, the Weibull distribution provides a rather accurate 
representation of the empirical distribution at small and intermediate values 
of r. But it goes to zero too fast for large r values, compared with the slow 
decay of the empirical probability density function [IT]. This justifies using 
some more flexible distributions, which are related to the Weibull family. 
One such possibility is the generalized gamma distribution (GGD) 

fir) = XgiXr; ^, /3), g{T; ^, /3) := e"-' . (33) 

Its survival probability is 

F (i^/0 t/^) 

Q{r) = G{XT-i!},/3), G{T-i!},P):=^r^^j^ , (34) 

where r{a,z) is the incomplete gamma function. The scale parameter A is 
given by 

so as to ensure that the mean time duration between trades is unity. 

From the point of view of the feasibility of analytical calculations of the 
integral ( 12^ . the GGD has the nice property of having analytical expressions 
for its Laplace images for any rational (3 and for arbitrary shape parameter 

Analytical expressions of the Laplace images of the GGD fl33|) . for some 
rational values of it shape parameter (3, are given in Appendix B.2. 
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Figure 5 shows the survival functions of the WeibuU distribution flHTl) for 
P = 0.8 and of the generahzed gamma distribution flMl) . for the same i} = 0.8 
and for /3 = 2/3, 1/2 and (3 = 1/3. Figure 6 compares the function B{lj) 
(129|) obtained for a Poissonian statistics of the tick- by-tick times {tk}, where 

fir) = e-^ ^ /(,) = _!_, (36) 

to the two functions B{u) for the GGDs for 6 = 0.8 and /3 = 1/2; 2/3, for 
which the Laplace images of the probability density function are given by 
the analytical expressions flRQj) . flRTOj) and dEH]), f lRTi]) . 



All curves represented in figure 6 coincide at = 0, which is the conse- 
quence of the fact that -B(O) does not depend on the shape of the distribution 
/(r) of the inter-trade interval durations. Note however the significant dif- 
ferences between the three realizations of B{u) at u ^ 0, which emphasizes 
the difference between the exponential probability density function and 
the GGD. 



3.4 Short-ranged correlations of discrete returns in cal- 
endar time 

In section 2, we have demonstrated that our microstructure return model 
predicts that the range of the correlations of the tick-by-tick returns is con- 
trolled essentially by the bounce distortion probability q (see figure 1). In 
particular, if q is close to zero, then the autocorrelation JCm ffTB]) of the tick- 
by-tick returns is a long-ranged (sign-alternating) function of m. In contrast, 
if q is close to the critical value go = 1/2, the correlation of the tick-by-tick 
returns becomes short ranged (as an illustration, see the upper and lower 
plots in figure 1). We now show that, even for small g's including q = 0, the 
correlation of the A-scale returns -Ra(^) (E]) in calendar time is short-ranged, 
as a result of the washing action of the random inter-trade intervals {t^}. 

To demonstrate this result, we rewrite the A-scale returns in the more 
convenient form 

i?A(t) = nA(t)®i?(t), IlA{t):=r (37) 

[0, t^(0,A), 

where the symbol ^ represents the convolution operation. Using fl23|) and 
fl57|) . the correlation function of the A-scale returns -Ra(^) is given by the 
convolution 

i^A(r) := E [i?A(t)i?A(t + 0)] = K{r) ® T^ir) = 

(38) 

£2 [6iT) + B{r)]®TAir) , 
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where 

^ . X [a - Irl, \d\ < A, 

^'^<^' {o, In > a! 

For the numerical estimation of the function K/^{t), we use relation 
(lB.17p . which is directly derived from relation fl55]) . Figure 7 shows the ratio 
Ka{t)/Kj\{0), for a = 1 and for different values of the bounce distortion 
probability q. One can observe the short range nature of the correlations of 
the A-scale returns in calendar time for all q values. Note the fact that the 
correlations become negative before decaying to zero. The results shown in 
figure 7 use the GGD ( 15^ for the distribution /(r) of the inter-trade inter- 
vals, with ■}) = 0.8 and /3 = 2/3. Analogous plots for q = and for different 
values of A are shown in figure 8. 



4 Microstructure noise effect 

4.1 Basic notions 

The goal of this section is to show that our model provides a natural set- 
up for the microstructure noise effect [H El [3l HI El [8]. The microstructure 
noise effect refers to the following phenomenon. Let us consider the realized 
volatilitjil], equal to 

. L^/AJ 

DiA,T) = - J2 RlikA). (40) 

k=l 

If -Ra(^) is a stationary and ergodic process, then the realized volatility f l40|) 
converges in probability as T goes to infinity to 

D{A) := [Rlit)] , (41) 

which is, by definition, the average volatility density over intervals of duration 
A. 

The geometric Brownian motion (GEM) L{t) = D ■ W{t) is the simplest 
and often used first-order model of price dynamics, where W(t) is the stan- 
dard Wiener process. The GBM is such that the volatility density -D(A) 
does not depend on A (-D(A) = D), so that its estimation is in principle 
independent of the time durations A. In addition, for GBM processes, the 

^In this context, volatility is defined as the variance of the log-price increments over a 
given time interval of duration A. 
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volatility is observable as, for A — t- and for any given observation interval 
of duration T = const, the realized volatility converges in probability to the 
volatility density D: 

D{A,T) D, A^O, T = const. (42) 

Real financial markets depart from the ideal GBM and exhibit the mi- 
crostructure noise effect, i.e., for small durations A that are comparable to 
the mean time interval f between subsequent trades, the realized volatil- 
ity is positively biased. Mathematically, this means that -D(A) defined by 
expression fHT]) increases as A decreases. 

Assertion 2 The next subsection demonstrates that the micro structure noise 
effect results simply from the negative return correlations inherent in the 
micro structure model proposed in the present paper. 



4.2 Strength of microstructure noise effect 

In order to describe quantitatively the microstructure noise effect, let us 
calculate the function -D(A) defined by expression ( BTj) in the frame of our 
microstructure returns model. Due to equality ( 155]) . one has 

D(A) = lirA(O) = ^2 (l + I U{r)B{r)dr^ . (43) 

Definition 3 We defined respectively by "true" and "microstructure" volatil- 
ities the following limit values 

Amc := lim D{A) , := lim D{A) . (44) 

A— s>oo A->-0 

The ratio 

S := (45) 

Dtrue 

defines the strength of the microstructure noise effect. 



The ratio S f l45|) has the following intuitive economic meaning. It is equal 
to the dimensionless ratio of the realized volatility -D(A) fHT]) at the micro 
(A —7- 0) and at the macro (A — t- oo) scales. This justifies that S provides a 
convenient quantitative characterization of the strength of the microstructure 
noise effect. 

The dependence of the strength S (H5|) of the microstructure noise effect 
with respect to the parameters of our microstructure noise model is obtained 
as follows. From expressions and ( 15^ . we obtain 

I^micro = £2, Arue = £2[1 + 5(0)], (46) 
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so that, 

S = 1 — . (47) 

1 + 5(0) ^ ^ 

In view of relation (1291), one has 



^ 2(l -2,h A. 

r(l + a) Jo e""" + 1 - 2« * ' 

Expression (147|) with fHHl) shows that iS does not depend on the shape of the 
probabihty density function /(r) of the inter-trade interval durations. It does 
depend on the other parameters (y,q,fi of the model, where a is the unob- 
servable exponent of the power law correlation 0121) of the auxiliary ARFIMA 
process {Xk}, q is the bounce distortion probability, and fi is the exponent 
of the power law describing the tail of the probability density function 0(r) 
(IT^ of the tick-by-tick returns. While a is not directly observable, it can 
be derived from the the observable exponent a of the power law dependence 
of the correlation flTSl) of the absolute tick-by-tick returns via the equality 
a = 2a. Figures 9-11 illustrate the dependence of the strength S{a,q,fi) of 
the microstructure noise effect on these three parameters. 

A further quantification of the strength of the microstructure noise effect 
is obtained by introducing the generalization of the strength S (H511 defined 

by 

5. §^ . (49) 

which now depends on the interval duration A over which the returns -Ra(^) 
are defined. Figure 12 illustrates the dependence of iSa for a = 0.1 (a = 0.2), 
fj, = 5 and for different values of the bounce distortion probability q. 



5 Epps effect 

5.1 Epps effect: basic notions 

The microstructure return model introduced in the present paper is able 
to identify the roots of the Epps effect [TT], El] , as we now demonstrate. 

For this, let us describe the Epps effect by using the notations introduced 
above. Let us consider the instantaneous returns Ri{t) and R2{t) of two 
assets. Analogously to (jSTj), the A-scale returns of the two assets during a 
time interval of duration A can be expressed with the relations 

RlAit) = nA(t) ® Rl{t), i?2,AW = ^A{t) ® R2{t) ■ (50) 
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The Epps effect is observed on a quantity tliat is analogous to fHTl) . except 
tliat under tlie expectation sign ttie square return R\{t) is replaced by the 
product of the two different returns -Ri,a(^) and R2,A(t) 

Di,2(A) := [i?l,A(t)i?2,A(t)] . (51) 

Z^i,2(A) is thus a measure of inter-dependence between the two assets. 

It is convenient to introduce a normalized version of i^i 2(A) (!5T|) . For 
this, we consider the simple case where the returns Ri{t) and -R2('^) are 
statistically equivalent in the sense that -Di(A) = D2{A) = D{A), where 

D,{A) := Ie [Rl^it)] , D2(A) := ^E [^^(t)] . (52) 

The normalized version of Di^2iA) flHT]) is defined as 

S'/ ■■= ^^j^, Aruc := lim D{A). (53) 
Now we are ready to formulate the Epps effect. 

Definition 4 Consider two assets whose A-scale returns -Ri,a(^) ond -R2,a(^) 
at scale A are correlated so that the following limit is positive 

lim 5^'^ > . (54) 

A—)- 00 

The Epps effect corresponds to the fact that S]^ flS^ is a monotonically 
increasing function of the argument A, and which vanishes as A — )■ 0. 



5.2 Epps effect paradox 

Before explaining the Epps effect in the frame of our microstructure re- 
turns model, it is illuminating to first discuss a paradox that emerges when 
comparing the dependence with A of the two similar functions 

Z}(A) = ^E [i?2^(t)] and Di,2(A) = ^E [i?i,A(t)i?2,A(t)] • (55) 

According to the microstructure noise effect discussed in the previous section, 
the first D{A) is a decreasing function of A while, according to the Epps 
effect, the second Di 2(A) is an increasing function of A. The paradox is 
that, while -R2,a(^) niay "only insignificantly" differ from -Ri,a(^), replacing 
one i?i,A(^) in the definition of -D(A) by R2,A{t), i.e., changing Rf ^(t) into 
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-Ri,a(^)-^2,a(^)5 change an increasing function into a decreasing function of 
A.' 

In order to elucidate this paradoxical bifurcation of the function -D(A) 
under the change of Rl ^(t) into -Ri,a(^)-R2,a(^), let us consider, analogously 
to (jl]), the instantaneous returns 

Ri{t) = J2n,kS{t-h,k)- (56) 

k 

This defines a singular stochastic process with infinite mean: 

E [Rl{t)] = oo. (57) 
Consider now another instantaneous returns process 

i?2(t) = 5^r2,fc 5(t-t2,fc) , (58) 

k 

and let us assume that it is only infinitesimally different from Ri{t), by as- 
suming that both tick-by-tick returns are identical {r2^k = '^i,k, V k), while the 
instants {t2,fc} of the second return process are only infinitesimally different 
from the instants {ti./t} of the first return process: 

t2,k = h,k + C, y k . (59) 
The time shift ( ^ is supposed to be infinitesimal so that we can write 

R2{t) = J2rk5{t-tk-C) = Ri{t-C) , (60) 

k 

together with 

Ri{t) = ^rk 5{t - tk), 4 := ti.fe, rk := ri^k = r2,k ■ (61) 

k 

In this case, contrary to equality (1571) . and notwithstanding the "infinitesimal 
difference" between Ri(t) and R2(t), the following equality holds 

E[R,{t)R2{t)] = . (62) 

This suggests that the resolution of the paradox rests on the fact that the 
so-called infinitesimal difference or shift of the tick-by-tick instants drasti- 
cally changes the correlation between the different instantaneous returns, 
"reducing infinity to zero" . 
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The same conclusion holds qualitatively for A-scale returns defined over 
a finite time interval A. Indeed, let us go from the instantaneous singular 
returns ( 16T|) . (l60l) to the regular returns 



RiA (t) = ^ rfc Ha (t - tfc) , R2A (^) = Ha (t - tfc - C) , (63) 

k k 

where C is a nonzero shift. Figure 13 shows a realizations of these two A-scale 
return processes for ^ > A, where both ( and A are significantly smaller than 
mean inter-trade time duration f. Although the A-scale returns -Ri,a(^) and 
i?2,A(i) are almost identical, their product is equal to zero 

i?l,AW-^2,A(t) = 0, (64) 

which is in accordance with the Epps effect, suggesting that the function 
^1,2 (A) dSH) is vanishing at A 0. 

Figure 14 shows another realization of the two A-scale return processes 
([nS]) now for < A, which implies that 

RiAit) ■ R2Aii) = RlA-(it) ^ . (65) 

Accordingly, as can be seen in figure 14, the function Di 2{A) (15T]) is equal 
to 

A,(A)4E[i?Lc(t)]--.^, (66) 
A > C, A < f , 
and increases (for A > () with A increasing, in accordance with Epps effect. 

Assertion 3 In sum, the Epps effect is stipulated by the (random) shifts 
between respective trades instants ti^k (md t2,k of different assets. Due to 
these shifts, the smaller the scale A of returns Ri,A{t) and -R2,a(^); the less 
likely it is that trade instants ti^k o-nd t2,fc belong to the same time interval 
(t — A,t). As a result, the A-scale returns RiAif) <ind -R2,a(^) become less 
correlated, thus giving rise to the Epps effect. The quantitative description of 
the Epps effect in the framework of our model of returns is considered below. 



5.3 Epps effect: description in the frame of microstruc- 
ture returns model 

To quantify the Epps effect in our microstructure returns model, we cal- 
culate the two functions L'i_2(A) (IFIi) and iS^'^ For this, we need to 
estimate the cross correlations of the instantaneous returns 

Ki,2{r) ■.= E[R,{t)R2{t + T)] . (67) 
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We consider the simplest model representing the inter-dependence between 
Ri{t) and R2it), which reads 

Ri{t) := R{t), R^it) := R{t + () , (68) 

where R{t) is some instantaneous return process and ( is some random delay 
time distributed according to some known probability density function k{t). 
If C is always positive, R2(t) can be said to be subordinated to Ri(t) but 
we do not specifically need this to hold true. We do impose however that 
the inter-dependence between the two return processes is symmetric, which is 
reflected into the evenness of the probability density function: h{—t) = k,{t). 

Using relations fl68l) and equality fl23|l for the correlation function K{t), 
we obtain 

K,^2{r) := K{t) ® K{r) = 62 [/s:(r) + B{t) ® h{t)] . (69) 

We can then represent the function Di^2{^) (ISTl) in a form analogous to 
relation fH3|) : 

D,4A) = Hr) + B{r) ® k{t)] U{T)dT . (70) 

To make the analytical calculations explicit, we assume for definiteness 
that the probability density function of the random delay time C, is Gaus- 
sian with zero mean and variance A^. In our numerical calculations, we use 
relation (IB.lSp which is equivalent to expression fl70l) . 

Figure 15 shows the dependence of the function S]^ defined in fl53|l as 
a function of A, clearly demonstrating the existence of the Epps effect in 
the frame of our microstructure returns model. We do not present plots for 
different a and q values because our calculations show that the dependence of 
S]^ as a function of A is practically undistinguishable for different a values 
and for any g > 0. 
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A Statistics of tick-by-tick returns 



In this appendix, we give a detailed description of our tick-by-tick mi- 
crostructure return model and derive its basic statistical properties. 



A.l Statistical properties of ARFIMA process 

We explore the statistical properties of the tick-by-tick returns {r^} ([7]), 
beginning with a discussion of the correlation properties of the ARFIMA 
process {X^}. 

It is known that the solution of equation is 



oo 



Its correlation function is equal to 



oo 



j=0 

where the {ctjj's are defined in flA.ll) . Calculating the sum in ( ]A.2p and 
choosing the factor T in flA.ll) such that pq = 1, we obtain 



r(l-d) T{d + m) 



r{d) r{l -d + m)' 
Using the well-known asymptotical relation 
r{d + m) 



(A.3) 



T(l-d + m) 



m"", a = l-2d , m > 1, (A.4) 



we conclude that pm is asymptotically a power law. 

For fractional orders d G (0, 1/2) of the difference lag equation ([H]), the 
expansion flA.4p holds accurately even for small m values [m > 1). With 
good approximation, one may thus express pm (1A.3I) by 

1 m = T(^) 

r{a):=--T^, a = l-2d. (A.5) 

Figure Al illustrates the accuracy of the power law approximation (lA.Sp 
transposed in the main text as expression ( 1T2|) . One can observe a relative 
error of no more than 1.4% even for m = 1. 
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Moreover, it follows from expressions flA.Sp that, for any m ^ 1, 
Pm ^ Pi{d) :-- 



sin(7r(i) T'^{d) 



(A.6) 



Figure A2 shows the dependence of pi{d) as a function of d. For any m ^ 1 
and (i G (0, 1/2), the correlation function pm satisfies the inequality 



O^p^^l/2, m ^ 1, rfe (0,1/2). 



(A.7) 



A. 2 Correlation of the bid-ask bounce factor 

The correlation of the tick-by-tick returns {r^} is significantly influenced 
by the correlation Cm '■= E [Mk Mk+m] of the bid-ask bounce factor Mk 
defined by expression (fTOj) . Using the fact that 

^ ^^(^^^^ ^^^^ _ = ^ , (A.8) 

s=k 

we obtain 

= E [(-1)^«('='"^)] . (A.9) 

Assuming that the sequence {ls} consists of iid random Bernoulli integers, 
equal to one with probability p and zero with probability g = 1 — p, this 
implies that S^{k, m) is, for a given m, a Binomial variable such that 



PT{6^{k, m) = s} = 




s = 0,l,...,m. (A.IO) 



Accordingly, the correlation Cm is equal to 

C^ = (2g-1)"^, (A.ll) 
which may be rewritten in the form f[T^ . 



A. 3 Statistics of the denominator Hk in expression ([7j) 

The iid positive random variables{iffc} have been introduced in the defi- 
nition of our microstructure return process in order to adjust the statistical 
properties of the tick-by-tick returns ([7]) to those of the empirical tick-by-tick 
returns. 

In order to determine what should be their distribution ip{ri), we rewrite 
equality ([7]) in the more convenient for our subsequent analysis: 

rk = Yk/Hk, Yk = XkMk. (A. 12) 
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From the structure of M^'s, we conclude that the probabihty density function 
of each numerator is, hke X^, possess by Gaussian probabihty density 
function ip{y) with zero mean and unit variance. 

We specify the probabihty density function fit]) of the H^s so that the 
distribution of tick-by-tick returns is a fat-tail distribution similar to the 
empirical returns (see, for instance, [TUB]) of the form 

0(r) ~ \r\-''-\ r ±00. (A.13) 

Thus, we need in probability density function 0(r) of tick-by-tick returns 
{rfc} (lA.12p . possessing by the same power tail. We suggest below that fi > 9 
in order for the moments 

ee:=E [H^'] < 00 (A. 14) 

of given order ^ > to remain finite. 

One can prove rigorously that a sufficient condition for the probability 
density function 

poo 

Lp{ri)ip{rri)ridri (A. 15) 



of the tick-by-tick returns {r^} flA.12p to possess a power law flA.lSp is 

i^{n) = r/'^-i$(r/), r]>0, (A.16) 

where $(?]) is nonnegative continuous function, positive at 77 = ($(0) > 0), 
and decaying, at 77 — )■ 00, faster than ri~^. A convenient candidate for (p{i]) 
is 

¥>iv) = n /ON V^~' exp ---^ . (A.17) 



2/^/2r(/i/2) 

In this case, the probability density function 0(r) of the tick- by-tick returns 
is equal to the Student distribution ([Hj). Correspondingly, the moments eq 
flA.14p are given by the expression 

A. 4 Calculation of the correlation of absolute values 
of returns 



In order to calculate the autocorrelation function (fT7|) of the absolute 
values of returns, notice that Ir^f = \Xk\^ /H^, as it follows from (lA.12p . 
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Accordingly, one has 



E [\rk\' \rk+J] 



e2eJ^{0,l), m = 0, 
elJ^{6,pm), m ^ 0, 

[KIT =^^mo), 



(A.19) 



where Se is given by flA.181) . while 



E ix,nx, 



k+m\ 1 



(A.20) 



After substitution flA.lQp into f|T7|) . we obtain 

6gJ-[ti,Pr. 



p< 1, 



(A.21) 



where 

:=^(^^,p)-^(^,0). (A.22) 

To calculate the function J^{9,p), we take into account that the joint 
probability density function of the Gaussian variables and X^+m is equal 
to 

1 



ip{XuX2;pr 

and we obtain 



exp 



Tie,p) 

■k/2 / ^ 



T{i + e) 



2(1 -pD 



(A.23) 



TT 



+ 



(l-r)^"'x 



'l + psinM)^+^ (1— psin-u)^" 
In particular, for 6* = 1, one has 

2 r 



(A.24) 



sin udu. 



^-"(1, p) = — a/1 — p^ + p arcsin p 



(A.25) 



The correlation p^ in its attainable range flA.7p is well approximated 
by the first nonzero term of the Taylor expansion with respect to p of the 
function F{0,p) flA.22jl . flA.24D . which is given by 

„ . ^mm,2 



Q{e,p) = gep' 



r(f) 



(A.26) 
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The accuracy of the above quadratic approximation is illustrated in figures A3 
and A4. In particular, figure A3 shows that the quadratic approximation 
Q{0,p) (1A.26|) and the function J^{6,p) (1A.22I) are almost indistinguishable. 
Figure A4 plots the ratio of these two functions in order to provide a quanti- 
tive estimate of the relative errors. In the worst case for 9 = 0.5 and p = 0.5, 
we observe a maximum 5% relative error. 

Replacing in (lA.2ip the function ^{9, pm) by its quadratic approximation 
flXMj) . we obtain 



Substituting in f lA.27|) the power law approximation f lT2|) of the autocorre- 
lation function p^ in place of the variable p, we obtain the power law flTS]) . 
where the factor x is given by 




m ^ 1 . 



m = 0, 



(A.27) 




(A.28) 
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B Statistical description of instantaneous re- 
turns 



B.l Correlation function of instantaneous returns 



We outline in this subsection one of the possible derivations of relation 
fl2T]) for the continuous part B{r) of the correlation function K{t) fl20|) of 
the instantaneous returns R{t) (jl]). 

Notice, first of all, that 



U62 B{t) = K{t) Vr ^ 0. 



(B.l) 



Using definition ^ together with the ordering rule ([5]) of the trade instants 
{tfc}, we rewrite relation ( 1B.1|) in the form 



m=l 



Vr > 0, 



(B.2) 



where {Bm} are the correlations f|T6|l of the tick-by-tick returns. We keep in 
mind that, within the assumptions of our microstructure return model, the 
sequences {r^} and {tk} are mutually statistically independent. In view of 
identity fIB.ip . we rewrite f lB.2p in the form 



vB{t) = Y,^^^ 



m=l 



^ - tk)6{t - tk+m + t) 



Vr > 0. 



(B.3) 



Applying the linear operator 



T 



...dt 



(B.4) 



to both sides of equality (IB.3P and using the commutativity between linear 
operations and statistical expectations, we obtain 



m=l 



1 

7f ^(^ - ik)Sit - tk+m + r)dt 

^ k=l 



Vr > 0, 
(B.5) 

where N{T) is the random number of instants {tk} belonging to the time 
interval t G (0,T). After integration, one has 



z/5(r) = ^S^E 



m=l 



^ N{T) 



T)dt 



k=l 



Vr > 



(B.6) 
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Consider the limit of the last relation for T — ?► oo. Using the Law of 
Large Numbers, one may replace, at T — )■ oo, the random number N{T) by 
its expected value E [A^(T)] = z/T. This yields 

uB{r) = V i3„ lim \5{r - r^,™)] , Vr > 0, (B.7) 

m=l 

where we have used notation Using the fact that the expectation in (IB.7P 
gives by definition the probability density function of the random interval 
Tk,m, fm{r) ■■= E [6{t - rk,m)], we finally obtain 

oo 

B{T) = Y,Bmfm{r) Vr>0. (B.8) 

m=l 

Taking into account the evenness of the correlation functions B^s, we obtain 
relation fl2T]) . 



B.2 Laplace images of some generalized gamma distri- 
butions 

We present analytical expressions of the Laplace images of generalized 
gamma distributions. 

In the case /3 = 1/2, one has 

f{s) = 91/2 f A ' ^i/2(^) = 2^9(1 + 2t9), (B.9) 



and 



21(2^9)5''^ si V 2/ V2 2 '4s 



1 v^(-l)''-nAQ , ^ A , 2A; + 1 1 

(B.IO) 

Here $(a, 6, 2;) is the Kummer function (confluent hypergeometric function). 
In the case /3 = 1/3, one has 

f{s) = ^1/3 {j^-^ ^ ) ' ^1/3 W = 3^(1 + 3^)(2 + 3^), (B.ll) 

and 



^ k {2k + lf + 7 41- (2fc-5)2 1 

1H2 { V + 



(B.12) 



3 24 ' 24 27s 
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Here pHgla; b; z) is the generalized hypergeometric series, while a and h are, 
correspondingly, vectors of lengths P and Q. 

An analogous analytical expression for the Laplace image f{s) of the 
probability density function /(r) taking the form of the generalized gamma 
distribution with /3 = 2/3 is given by 



and 



2 " 2 ' " 24 ' 24 ' 27s2 

(6A;-5)2 + 47 A;(13 - 3A;) 
' 144 ' "^(fc) := • 

(B.14) 



B.3 Spectral representations of the expectations of quadratic 
returns 

In the numerical calculations of the correlation functions of returns, we 
use relations that express the correlation functions through the corresponding 
power spectra. For instance, we calculate the continuous part B{t) of the 
correlation function K{t) of the instantaneous returns R{t) by using the 
inverse Fourier transform 



1 roo 
B{r) = - 

TT .,0 



B{uj) cos{uT)du. 



(B.15) 



Analogously, using the fact that the spectrum K^{u) of the correlation func- 
tion Ka{t) (138|) is given by 



Ztc 



4 



1 + 5M 



7a(w) := 2 / 7a (t") cos (a;r)(ir =— sin --- 



00^ 



(B.16) 



we obtain the following relation 



71 



1 + B(uj) 



cos{uT)du 



(B.17) 
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which is convenient for numerical calculations. Similarly, we have used in 
our numerical calculations the relation 



poo 


1 + B{uj) 


Jo 




m, 


where 7a 



k{uj)T/\{uj)duj, 



(B.18) 



k{uj) = 2 / k{t) cos{ujT)dT = exp 



(B.19) 



29 



References 



[1] Ait-Sahalia, Y., Yu, J., 2009, High Frequency Market Microstructure 
Noise Estimates and Liquidity Measures. The Annals of Applied Statis- 
tics, 3, 422-457 

[2] Andersen, T. G., Bollcrslcv, T., Diebold, F. X., Labys P., 1999, Realized 
Volatility and Correlation. 

[3] Bacry, E., Delattre, S., Hoffmann, M., Muzy, J.-F., 2010, Modelling 
microstructure noise with mutually exciting point processes. Submitted 
to Quantitative Finance 

[4] Bandi, F. M., Russel, J. R., 2006, Separating microstructure noise from 
volatility. Journal of Financial Economics, 79, 655-692 

[5] Bandi, F. M., Russel J. R., 2008, Microstructure Noise, Realized Vari- 
ance, and Optimal Sampling. Review of Economic Studies, 75, 339-369 

[6] Beran, J., Statistics for Long-Memory Processes, Chapman & Hall/CRC 
Monographs on Statistics & Applied Probabihty (1994) 

[7] Bouchaud, J-P., Potters, M., 2006, Financial Risk and Derivative Pric- 
ing (Cambrdidge, UK: Cambridge University Press) 

[8] Cont, R., 2001, Empirical properties of asset returns: styhzed facts and 
statistical issues. Quantitative Finance, 1, 223-236 

[9] Cont, R., Stoikov, S., Talreja, R., 2010, A Stochastic Model for Order 
Book Dynamics. Operations Research, 58, 549-563 

[10] Ding, Z., Grander, C. W. J., Engle, R. F., 1993, A long memory property 
of stock market returns and a new model. Journal of Empirical Finance, 
1, 83-106 

[11] Epps, T. W., 1979, Comovcments in Stock Prices in the Very Short Run. 
Journal of the American Statistical Association, 74, 291-298. 

[12] Miinnix. M.C., R. Schafer and T. Guhr, 2011. Statistical causes for the 
Epps effect in microstructure noise. International Journal of Theoretical 
and Applied Finance 14 (8), 1231-1256 

[13] Granger, C. W. J., Joyeux, R., 1980, An Introduction to Long-Memory 
Time Series Models and Fractional Differencing. Journal of Time Series 
Analysis, 1, 15-29. 



30 



[14] Hosking J. R. M., 1981, Fractional Differencing. Biometrica, 68, 165-176 

[15] Ivanov, P. Cli., Yuen, A., Podobnik, B., Lee, Y., 2004, Common scaling 
patterns in intertrade times of U. S. stocks. Phys. Rev. E, 69, 056107- 
1-7 

[16] Miinnix, M. C, Schafer, R., Guhr, T., 2010, Impact of the tick-size on 
financial returns and correlations. Physica A, 389, 4828-4843 

[17] Politi, M., Scalas, E., 2008, Fitting the Empirical Distribution of Inter- 
trade Durations. Physica A, 387, 2025-2034 

[18] Rhee S. C, Wang, C.-J., 1997, The Bid-Ask Bounce Effect and the 
Spread Size Effect: Evidence from the Taiwan Stock Market. Pacific- 
Basin Finance Journal, 5, 231-258 

[19] Sazuka, N., 2007, On the gap between an empirical distribution and an 
exponential distribution of waiting times for price changes in a financial 
market. Physica A, 376, 500-506 

[20] Scalas, E., 2007, Mixtures of compound Poisson processes as models of 
tick-by-tick financial data. Chaos Solutions & Fractals, 34, 33-40 

[21] Toth, B., Kertesz, J., 2009, The Epps Effect Revisited. Quantitative 
Finance, 9, 793-802 

[22] Voev, v., Lunde, A., 2007, Integrated Covariance Estimation using 
High- Frequency Data in the Presence of Noise. Journal of Financial 
Econometrics, 5, 68-104 

[23] Zhang, L., Mykland, P.A., and AiVSahaha, Y., 2005, A Tale of 

Two Time Scales: Determining Integrated Volatility with Noisy High- 
Frequcncy Data. Journal of the American Statistical Association, 100, 
1394-1411 



31 




32 



10° 



10^ 



10* 10' 

m 



10^° 



Fig. 2: Exact dependence of the autocorrelation function ^m(^) (fT7|l . 
for n = 4, a = 0.2 and 9 = 0.5; 1; 1.5, illustrating the high accuracy 
of the approximate power law (|18p qualified as a straight line in this 
log-log plot. 
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Fig. 3: Dependence as a function of 6 of the normalized correlation 
function (fT9]l for different values of the exponent /i of the Student 
distribution </)(r) p^ . 
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Fig. 4: Representation along the time axis of the time sequences 
{tk} of trades with = X]i=i''"i) where the inter-trade intervals 
Tj's are drawn from the Weibull distribution with shape parameters 
/3 = 0.5; 1; 2 (top to bottom). The corresponding values of the scaling 
parameter A^(/3) are given by expression (|32p . The value (3 = 1 corre- 
sponds to pure Poisson memoryless sequences {tfe}. For (3 = 0.5, one 
can observe some events clustering. In contrast, for (3 = 2, the time 
sequence is more regular, akin to quasi-periodic. 
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Fig. 5: Survival functions of the Weibull distribution (f3T|) for /3 = 0.8 
(lower curve) and of the generalized gamma distribution (j34p , for the 
same i} = 0.8 and for (3 = 1/3, 1/2 and (3 = 2/3 (first, second and 
third curves from top at the right). 
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Fig. 6: Comparison of the function B{uj) (|29|) (top curve) obtained 
for a Poissonian statistics of the tick- by-tick times {t^} ()36p to the 
two functions B(u}) for the GGDs for /3 = 1/2; 2/3 (first and second 
curves from bottom), for the same i? = 0.8. The other parameters are 
Q = 0.1, /i = 4 and q = 0.1. 
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Fig. 7: Normalized correlation function K/^(t)/ Ki\(0) of the A- 
scale returns R^it) in calendar time, for A = 1 and q = 0; 0.1; 0.2; 0.3 
(bottom to top). The other parameters are a = 0.1,/x = 4, ?? = 
0.8 and 13 = 2/3. It is clearly seen that, regardless of the value of 
the bounce distortion probability q {q < 0.5), the returns R/\{t) in 
calendar time are negatively short-ranged correlated for r > 1. 



38 




Fig. 8: Same as figure 7 for different values of A. Left panel: A = 
1;2;3 (left to right) and q = 0. Right panel: A = 1;2;3 and q = 0, 
depicted in the calendar time scaled to A. 
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Fig. 9: Dependence of the microstructure noise strength 5 as a func- 
tion of the exponent a, the exponent of the power law correlation (jl2p 
of the auxiliary ARFIMA process {X^^. Here, fi = 4. Top to bottom: 
q = 0; 0.1; 0.2; 0.3; 0.4; 0.5 
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Fig. 10: Dependence of the microstructure noise strength 5 as a 
function of the bounce distortion probabiUty q, for = 5. Top to 
bottom: a = 0; 0.1; 0.2; 0.3; 0.4; 0.5 
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Fig. 11: Dependence of the microstructure noise strength 5 as a 
function of the exponent /U, the exponent of the power law describing 
the tail of the probability density function (f){r) ()14p of the tick-by-tick 
returns. Here, a = 0.2. Top to bottom: q = 0; 0.1; 0.2; 0.3; 0.4; 0.5 
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Fig. 12: Dependence of the generalized strength 5a defined by (jl9|) 
of the microstructure noise effect as a function of the interval duration 
A over which the returns i?A(i) are defined, for a = 0.1 (o" = 0.2), 
^ = 5 and for q = 0; 0.1; 0.2; 0.3; 0.4 (top to bottom). 
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Fig. 13: A realization of the A-scale returns i?i,A(i) and R2.A{t) 
(bold and thin lines correspondingly) for A < so that identity ()64p 
is true. 
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Fig. 14: A realization of the A-scale returns Ri^^t) and -R2,A(i) for 
A > so that equahty ([66]) holds. 
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Fig. 15: Demonstration of the Epps effect in our microstructure 
model with the dependence of the function 5^'^ defined in ()53p as a 
function of A, for a = 0.1 (cr = 0.2), ^ = 4 and q = 0. Top to bottom: 
A = 1;2;3. 



46 




0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 

d 



Fig. Al: (i-dependence of the ratio Pm/ Qm of the correlation function 
Pm and its power law approximation Qm for d G (0,0.5), illustrating 
the uniform accuracy of the approximation ()12p . 
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Fig. A2: Dependence of pi(d) defined in (|A.6p as a function of d. 



48 




Fig. A3: Comparisons of the function J-{0, p) ()A.22p with its quadratic 
approximation Q{9,p) ()A.26p as a function of p. Bottom to top: 
9 = 0.5;1;1.5. For each 0, the two functions are almost indistin- 
guishable. 
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Fig. A4: Ratio of the function J-{0, p) and its quadratic approxi- 
mation Q{9,p) as a function of p, for all possible pm {m > 0) values 
as given by condition (|A.7p . illustrating the accuracy of quadratic ap- 
proximation ()A.26p . Top to bottom: 9 = 0.5; 1; 1.5. 
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